Method and apparatus for recommending selections based on preferences in a multi-user system

ABSTRACT

A system for determining selections that a user is likely to be interested in. The determination is made based on the user&#39;s prior indicated preferences. The user designates his or her preferred selections as entries in a user&#39;s preference list. Entries in the user&#39;s list are compared with entries in the other users&#39; lists. When a significant number of matches have been found between two lists, the unmatched entries of the other user&#39;s preference list are extracted. The unmatched entries are further processed. Those unmatched entries with a high correlation to the user&#39;s preference list are presented to the user as selections in which the user is likely to be interested.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. patent application Ser. No.09/874,563 filed on Jun. 5, 2001, which is a continuation of U.S. patentapplication Ser. No. 08/707,773 filed Sep. 4, 1996, which is acontinuation of U.S. patent application Ser. No. 08/119,793 filed Sep.9, 1993 (now issued as U.S. Pat. No. 5,583,763), all entitled METHOD ANDAPPARATUS FOR RECOMMENDING SELECTIONS BASED ON PREFERENCES IN AMULTI-USER SYSTEM, with the named inventors John Atcheson and James R.Miller, III, all of which are hereby incorporated in their entireties byreference.

BACKGROUND OF THE INVENTION

The generation, replication and transmission of information by automatedtechnology has far surpassed a person's ability to keep up with theinformation. Not only is this true in the business world, but ourleisure activities also involve sorting through an overwhelming numberof choices and making intelligent selections. For example, there is anabundance of entertainment in digital-audio or digital-video form, suchas compact discs (“CD's”), mini-discs, digital audio tape (“DAT”), laserdiscs, computer graphics, high-definition television (“HDTV”), etc. Theaccelerating ability of communication systems to convey this informationvirtually instantaneously means that we have access to many thousands ofselections.

The trend is for computers to be integrated with household devices, suchas telephones or televisions, giving rise to “smart phones” or“intelligent” or “interactive” televisions. A telephone with computercapability could be used to “download” digital information,representing, for example, music selections, into the memory of thetelephone for later playback. Alternatively, the downloaded informationcould be written to a medium such as a writeable compact disc. Thus, thesmart phone would be capable of downloading high-fidelity music onto aCD that could be played at the user's leisure on his or hersophisticated stereo system. Since the telephone is connected to a vastnetwork, such a system would provide an efficient method of exchange ofsound information. In effect, a user could, for example, “dial-up” acentral processor and employ a push-button phone or other input deviceto request music by the user's favorite recording artist and have itdownloaded into the user's home. A similar setup is possible for visualimages downloaded to an interactive television via, e.g., a cablenetwork hat connects the interactive television to a central site wheremovies are stored and available for downloading.

However, one problem with these systems is that it is often difficultfor a user to keep abreast of all of the possible selections for soundand image data, that is, current releases by a favorite recording artistor current movies that might be of interest to the user. The speed withwhich new entertainment selections are provided, not to mention the pastdecades of audio and visual works already in existence, results in ahuge number of selections that is ever-growing.

One way for a user, or subject, to make a selection from a largecollection of digital “objects” is to allow the user to input selectioninformation into a network. The selection information is then sent tothe central or host processor which categorizes the types of objects,e.g., music or video selections, that are of interest to the user andpresents the user with these categories of selections. However, thecategories are usually broad.

For example, in the music realm a user might make selections in thecategories of “country music” or “jazz.” These categories are extremelybroad, each encompassing many thousands of recordings. On the other handa user might indicate a specific artist, although this is extremelylimiting in that only recordings by that specific artist can logicallybe associated with the user's selection. Also, this method of having theuser provide category selections places the burden of keeping track ofnew developments on the user himself. The categories may change, forexample, rock music has split into various other categories such asheavy-metal, modern, and alternative. Category selection is alsolimiting to the user since it can only provide what the user knows howto ask for.

SUMMARY OF THE INVENTION

The present invention provides an efficient means for presenting a userwith recommended selections from a large number of possible choices. Theinvention is applicable to any type of information “objects” that can beordered according to the preferences of a user, or “subject.” Preferenceinformation from the users is obtained via the network and therecommended selections are communicated to the users over the network.In a preferred embodiment of the invention, the objects are digitalsound recordings.

In a first embodiment, the invention comprises a method executing on acomputer system which includes a processor, database, an input device,and an output device. The database includes a plurality of data files,each data file containing a plurality of preferences. A target userinputs signals to indicate a set of preferences. The database issearched to determine the number of user preferences that matchpreferences in the data files. If the number of matched preferences fora given data file is above a predetermined threshold, the non-matchingpreferences in the data file are output as recommended selections to theuser.

Thus, the invention operates on the assumption that if two people havesimilar tastes in music, it is likely that the preferences of one willbe desired by the other.

A second embodiment of the invention uses a computer system including aprocessor, database, input device and an output device. The processor isused to store pairs of ranked objects in the database. A ranking numberis associated with each of the pairs. A user of the system selects anobject, and occurrences of the selected object in the pairs aredetected. For each pair in which the selected object occurs, the objectin the pair that does not match the selected object is determined. Theprocessor is used to order all non-matching objects into a listaccording to the ranking number for the non-matching object's pair.Finally, the ordered list is output as a list of recommended objects.

Various parameters of the systems described above are adjustable. Theseparameters include the size of the list of preferences associated witheach user and the number of matches required between two users beforepresenting the matching user with the matched user's unmatchedselections. Also, the manner of weighting and ordering unmatchedpreferences is adjustable.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a portion of a network used to transfer informationin the present invention;

FIG. 2 is a block diagram of the host processing station of FIG. 1;

FIG. 3 shows a flowchart of a routine for correlating user preferencesin the present invention;

FIG. 4 is an illustration of a specific example of carrying out thesteps of the flowchart of FIG. 3;

FIG. 5 shows a detailed example of the execution of steps in FIG. 3;

FIG. 6A is an illustration of a first half of a flowchart describing aroutine of the present invention;

FIG. 6B is an illustration of a second half of a flowchart describing aroutine of the present invention; and

FIG. 7 shows a table illustrating the method described in Table 2 andFIGS. 6A and 6B.

DESCRIPTION OF A PREFERRED EMBODIMENT

FIG. 1 shows a portion of a network 100 used to transfer information ina preferred embodiment of the present invention. In FIG. 1, network 100includes host processing station 102 that is the storage center forinformation to be distributed, such as sound or image information.Regional hosts such as regional host 104 are connected to host 102 viacommunication links such as link 106. The regional hosts are used toserve users of network 100.

For example, regional host 104 is connected to user terminal 108 throughuser link 110. The regional hosts act as “front ends” to host processingstation 102. A regional host performs the input and output (I/O)functions for each of the multiple user terminals connected to it. Theregional host may include a large amount of local storage for containinginformation such as user selections, user preferences, sound and/orimage information, as discussed below, similar to host processingstation 102.

FIG. 1 is illustrative of but one configuration of a network toimplement the present invention. In FIG. 1, any interconnectionarrangement between host processing station 102 and the regional hostsis possible. For example, host processing station 102 can be connectedto regional hosts via a star, ring, bus, or other connection scheme.Links such as link 106 can be by hardwire connection, fiber optics,radio frequency communication, etc. Regional hosts can be stand-alonecomputer systems connected to mass storage devices, or can be networksin themselves. The user links such as user link 110 can be achieved byany means known in the art. A preferred implementation of user links isby existing telephone or cable television networks. User terminals suchas user terminal 108 can be a personal computer, or telephone ortelevision with digital transmit/receive capability.

In a preferred embodiment, the central host processor station includesone or more computers connected to mass storage devices such as multipledisk drives. The link between host and regional host is by hardwire,fiber optic or satellite transmission, as available. A regional hostoperates as a communications control station between hundreds, orthousands, of user terminals and the host processing station. Theregional host has local storage that serves primarily as a buffer forI/O. However, many of the functions discussed herein regarding the hostprocessing station also can be performed by the regional hosts, asdesired.

In a preferred embodiment, the present invention is adapted to a musicselection and distribution service that provides users with recommendedselections of songs that have a high likelihood of suiting their tastes.This system uses the user's household telephone to accept input in theform of touchtone keypad key presses to allow the user to selectrecordings from a menu and have the recordings delivered to the user'shome. Delivery of the recordings can be by traditional methods such assending a compact disk (“CD”) through the mail, or by more advancedmethods such as “downloading” digital information to a user's computerfor playback either from the computer or through a high fidelity homesound system.

For example, the downloaded digital information could be used to createa CD on a writable CD disk drive attached to the computer.Alternatively, the digital information could be written to a digitalaudio tape (DAT) format. Also available today is a “mini disk” formatwhich is a writable format. Other possibilities for transferring digitalrecording information to allow the user to play it back at their leisureare possible. Thus, one aspect of the invention provides a userinterface for making selections from the user's home.

To continue with the music selection service example, a user dials anumber and is connected into a network to which the host processingstation is connected. The user is given a menu of choices for proceedingwithin the service. One option is for the user to define “preferences”,or a list of their favorite artists. For example, a list of the top tenfavorite recording artists is maintained for each user of the service. Auser is given the opportunity to define or modify the list of top tenfavorite artists by using the buttons on their telephone or by operatingthe keyboard or mouse or other input device on their computer. The usercan also access a list of recommended selections which the service hasdetermined might interest the user. The manner in which the suggestedselections are generated is described in detail, below.

Another option the service provides for the user is the ability topreview recordings before the user downloads them. That is, the user canhear a short segment of the musical recording over the telephone beforedeciding to purchase and download the recording. The preview can be ofsuggested selections provided by the service, selections the user hasheard about and can identify by name, or selections of a specificartist, band, or style of music (e.g., country, jazz, rock).

The present invention uses the user's preference list to Identifyselections that would most likely be of interest to the user. This isdone by correlating the user's preference list with the preference listsof other users of the service. The correlation is performed in anefficient manner since all of the preference list information ismaintained at the centralized host processing station discussed above inconnection with FIG. 1. While it is possible that the correlation ofuser preferences can be done at the regional host level (to takeadvantage of geographical dependencies in user preferences, forexample), it is advantageous to include as many users as possible in thecorrelation in order to achieve a more reliable correlation.

FIG. 2 is a block diagram of the host processing station 102 of FIG. 1.FIG. 2 shows host processing station 102 including an I/O controller120, processor 122, memory 124, operator interface 126, and storagedevices at 128.

I/O controller 120 transmits and receives signals from the regionalhost, such as regional host 104 of FIG. 1. I/O controller 120 transfersinformation to and from processor 122. Processor 122 is connected tomass storage devices at 128. Processor 122 is also connected to memory124. Memory 124 is typically solid state memory for fast random accessand is used, for example, to perform buffering functions, intermediatestorage for mathematical calculations, database functions, etc.

Operator interface 126 is connected to processor 122. The operatorinterface 126 provides a human interface at host processing station 102.By using the operator interface, one can provide better correlations byvarying the parameters defining how the correlations are made to userpreference lists. The operator interface may be implemented by any meansknown in the art, such as having the operator type in numbers forvarious parameter values. The purpose of allowing an operator to adjustparameters from time to time is to “tune” the system, based on a trialand error basis, to achieve the best results. This is necessary sincethe goal of providing accurate suggested selections is a subjective one.

It will be apparent to one of ordinary skill in the art that manyconfigurations for the host processing station 102 that vary from thatof FIG. 2 are possible. Specifically, additional hardware components(functional blocks) may be present in host processing station 102 suchas tape drives and additional processors. Any suitable computer systemmay be used.

Table I and FIGS. 3-5 will be discussed in connection with anexplanation of a specific embodiment of a correlation method.

Table I lists the steps to perform “Subject to Object” correlation togenerate suggested selections based on a comparison of a user'spreference list with the preference lists of other users of the service.The steps of Table I are discussed in reference to FIGS. 3-5 where adetailed discussion of a method for performing the correlation asexecuted by a computer is discussed.

TABLE I Subject to Object Correlation Two or more persons list theirfavorite objects 1 . . . n. To generate recommendations for any oneparticular person (the “subject”), the following procedure is followed:Search the database of all other people to see if anyone has listed n −1 of the same objects. Each match will generate one object that was notlisted by the subject. Calculate the total number of occurrences foreach such object and create two rank tables. In the first table, rankfrom maximum to minimum the absolute number of occurrences for eachobject. In the second table, rank from maximum to minimum the ratio ofthe number of occurrences for each object divided by the total number ofoccurrences for that object throughout the entire database (i.e., rankthe “normalized” number of occurrences for each object). For eachobject, compare the rankings between the first and second tables and usethem to create a third table. The third table is comprised of a weightedaverage of the two rankings. The weighting of the rankings can rangefrom 100% of the first ranking and 0% of the second to 0% of the firstand 100% of the second, depending on the level of weighting desired, andprovided an identical level of weighting is applied to all objects. Theresults in the third table are then ranked from maximum to minimum. Ifn > 2, then search the database of all other people to see if anyone haslisted n − 2 of the same objects, where “n − 2” is the “matchup” number.Each match will generate two objects that were not listed by thesubject. Calculate the total number of occurrences for the unlistedobjects. Repeat the above procedure similarly for all values of thematchup number down to, and including, 1. Using the ranked weightedaverage (i.e., the “third”) tables for n − 1 . . . n − (n − 1) searches,create a final table. The final table is comprised of a weighted averageof the weighted average rankings for n − 1 . . . n − (n − 1) objectssearches. The weighting of the rankings can range from 100% of the n − 1table and 0% of the other tables to 100% of the n − (n − 1) table and 0%of all other tables, depending on the level of weighting desired, andprovided the weighting of all tables combined equals 100%. Rank theresults in the final table from maximum to minimum. Use this finalranking to provide recommendations to the subject ranging from mostlikely to enjoy (the maximum ranked object) to the least likely to enjoy(the minimum ranked object).

FIG. 3 shows flowchart 200 of a routine for correlating user preferencesin the present invention. In a preferred embodiment, the routinecorresponding to flowchart 200 of FIG. 3 is executed on a computersystem, such as host processing system 102 of FIG. 2.

Flowchart 200 can be implemented with one or more software routines onone or more computers. Further, flowchart 200 illustrates merely thebroad logical flow of a routine to accomplish the correlation procedureof the present invention. Other flowcharts implementing suitablecorrelations will be possible while still remaining within the scope ofthe present invention. A software routine, or routines, implementingflowchart 200 may be written in any suitable computer language, such as“C”, FORTRAN, assembly language or others. For ease of discussion, wewill refer to the routine of flowchart 200 as a single routine.

The routine is entered at step 202 where it is assumed that a userpreference list exists to which a correlation is to be made with otherexisting user lists in a computer database. The database could exist,for example, in memory 124 of FIG. 2. In step 204, a “reference list” isinitialized with n user preferences. In this example, we assume thatuser preference lists contain five entries each. Thus, at step 204 ofFIG. 3, the reference list is initialized with five entries of a userpreference list to which a correlation of preexisting preference listsin the database is sought. In other words, the routine of flowchart 200will provide selections in the database that have a high likelihood ofbeing desired by the user whose preference list has been submitted tothe routine.

At step 206, the variable m is initialized to n−1, that is, m now hasthe value 4 since n is assumed to be 5 as discussed above. Also at step206, a “temp. list” is cleared to be used by the routine as describedbelow.

At step 208, a check is made to test whether m is greater than 0. If mis greater than 0, execution proceeds to step 210. In the presentexample, since m has just been assigned the value 4, the check at 208 istrue and execution proceeds to step 210.

At step 210, a check is made as to whether there are predefinedpreference lists in the database that have not yet been checked againstthe reference list. Since we have just entered the loop, none of thepreference lists in the database has been checked so execution proceedsfrom step 210 to step 212. At step 212, the next preference list isobtained from the database and made the “current list.” At step 214, acheck is made as to whether the current list has m entries that matchthe reference list. In other words, since m has the value 4, there mustbe four entries in the current list that match four of the five entriesin the reference list. Since the reference list contains the userpreferences, this means that step 214 of flowchart 200 is attempting tomatch preexisting preference lists in the database that have at leastfour entries in common with the user preferences. If a match is found atstep 214, execution proceeds to step 216 where the unmatched entries inthe current list are included in a “temp. list.” In our example of mhaving the value 4, this would mean that one entry in the current listis not matched in the reference list. This one unmatched entry is addedto the temp. list. If the entry already exists in the temp. list, then acount of the number of occurrences of the entry being added to the temp.list is incremented.

After executing step 216, the routine returns to step 210 where a checkis again made as to whether there are more unused preference lists inthe database. If there are, execution continues to step 212 where, asbefore, the next list is obtained and checked for four matches againstthe reference list. If there are four matches, then the unmatched entryof the current list is added to the temp. list and so on. However, if,at step 210, there are no more lists in the database to be checked,execution continues to step 218 where the entries in the temp. list areordered according to their counts. This means that entries with highcounts are placed higher an the list, “List 1,” in step 218.

At step 220, a second list, “List 2,” is created in which the entriesare ordered according to their normalized counts. As before, in formingList 1, the entries are obtained from the temp. list and are orderedaccording to their frequency of occurrence as unmatched entries in thecurrent list in the loop of steps 210-216 described above. The method ofnormalizing the counts is described in more detail below. At step 222,List 1 and List 2 are weighted and combined to generate a “selectionlist m” where m is an integer value. In the present example, since m hasthe value 4, List 1 and List 2 will be combined to generate “selectionlist 4.” At step 224, m is decremented (i.e., m now equals 3) and the“temp. list” is cleared. Execution then returns to step 208.

At step 208, a check is made as to whether m is greater than 0, asbefore. Since m is now 3, execution proceeds to step 210 where the loopof steps 210-216 is executed to find matches in preference lists in thedatabase that have three entries matching with the reference list.

After all of the preference lists in the database have been checked, theloops of steps 210-216 produce a temp. list with the unmatched entriesof preference lists in the database that have three or more entriesmatching the reference list. At step 218, the entries in the temp. listare ordered by their counts as discussed above and stored as List 1. Atstep 220, the entries are ordered in List 2 similarly to step 218,except that the normalized counts are used to order the entries. At step222, List 1 and List 2 are weighted and combined into a single list“selection list 3” (since m has the value 3 here). At step 224, m is nowset to the value 2 and the temp. list is cleared so that the loop ofsteps 210-216 will be reentered for the case of m being 2. Thus,selection list 2 and selection list 1 will be formed with an orderedlist based on preference lists in the database that have, respectively,2 and 1 entries matching the reference list.

After m has been decremented to 0, the check at step 208 will result infalse and execution will continue to step 226 of FIG. 3. At step 226,the selection lists 4, 3, 2 and 1 are combined to form a single list.Discussion of the details of the weighting and combining of theselection lists is presented below. At step 228, the highest rankedselections of the weighted and combined selection lists generated atstep 226 are presented to the user as the result of the correlationroutine of flowchart 200.

Finally, the routine is exited at step 230.

Next, FIG. 4 will be discussed to show a specific example of carryingout the steps of the flowchart 200 of FIG. 3.

FIG. 4 shows preference lists, such as user's preference list 300. Asshown in FIG. 4, user's preference list 300 has entries a, d, p, s andy. Database preference lists are shown at 302. For example, databasepreference list 304 includes entries b, p, s, a and d. In a preferredembodiment, entries designated by lower case letters in FIG. 4 willactually be the names of recording artists. Also, the number ofpreference lists in the database will be very large, on the order ofthousands or even tens of thousands of preference lists. N, the numberof entries on a user's preference list, is given the value 5 for thisexample. The specific numbers used here are merely values used for easeof discussion. In practice, all of the parameters, such as n, m, thenumber of preference lists compared to in the database, and the“weights” to be discussed below, are variable. These parameters, areused to adjust the correspondence routine of FIG. 3 to achieve optimalperformance. In other embodiments of the invention, different parametersmay be used.

Thus, referring to flowchart 200 of FIG. 3 and the preference lists andvalues of FIG. 4, the following discussion tracks the specific exampleof FIG. 4 in association with the steps of flowchart 200 of FIG. 3.

Step 204 of FIG. 3 calls for initializing a reference list with the nuser preferences. Thus, the reference list (an internal list manipulatedand accessed by the processor executing the routine of flowchart 200)will be identical to user's preference list 300. That is, the referencelist will have entries a, d, p, s and y in that order. At step 206, mobtains the value 4 (i.e., 5−1) and the temp. list is cleared. At step208, m will be greater than 0 so that the loop of steps 210-216 isentered to compare the reference list with preference lists in thedatabase, such as those shown at 302 of FIG. 4.

In FIG. 4, user's preference list 300 consists of {a, d, p, s, y} whiledatabase preference list 304 consists of {b, p, s, a, d}. Therefore,there are four matching entries, namely, a, d, p, and s. The singleunmatching entry in database preference list 304 is entry b. Thus, theentry b is entered into a temp. list (not shown) as occurring once sofar. The next time through the loop of step 210-216 of FIG. 3, the nextdatabase preference list 306 is compared with the user's preference list300. The matching entries are d, s, and p. Thus, it is seen thatdatabase preference list 306 only has three matching entries with user'spreference list 300. Therefore, at step 214 of FIG. 3 where a check ismade for m, or at this point 4, matching entries the check will fail andexecution will proceed to step 210 without executing step 216.Therefore, none of the entries of data preference list 306 will beinserted into the temp. list.

Next, database preference list 308 is compared with user's preferencelist 300. This gives entries y, a, p and s as matching entries. Sincethere are four matching entries, the unmatched entry, b, is again addedto temp. list. Since b already exists on the temp. list, the countassociated with entry b is incremented to 2. The loop of steps 210-216continues until the last database preference list 310 is compared touser preference list 300. Database preference list 310 has entries a, d,s and p as matching. This results in the unmatched entry q of databasepreference list 310 inserted into the temp. list. Finally, there are nomore database preference lists to be checked and execution proceeds tostep 218 of FIG. 3.

At step 218 of FIG. 3, the entries in the temp. list are orderedaccording to their counts and stored as List 1. FIG. 4 shows List 1 320with entries from the temp. list at 322 and the associated countsadjacent to each entry at 324. Thus, List 1 has entry b with a count of1080, entry z has a count of 962, entry c has a count of 220, entry qhas a count of 58, entry t has a count of 7 and entry u has a count of1.

At step 220, the entries are ordered by their normalized count stored asList 2. In order to calculate the normalized count value, the number ofoccurrences of each of the entries in the temp. list (i.e., the entriesat 322 of List 1, although in a different order) are divided by thetotal number of times that each entry occurs throughout all preferencelists in the database.

In FIG. 4, the entry b of List 1 is determined to have occurred 32,761times. Similarly, entry z occurs 38,092 times, entry c occurs 5,010times, entry q occurs 898 times, entry t occurs 25,586 times and entry uoccurs 13,910 times. These values are next multiplied by an arbitraryvalue of 1,000 to yield the normalized count values at 326 of FIG. 4.The entries are then sorted according to their normalized count valuesand placed in List 2 according to their normalized count values.

List 2 shows the entries of List 1 arranged according to theirnormalized count values. In List 2, entry q is the highest ranked entrywith 64.6 as a normalized count value. Entry c is next, followed byentries b, z, t and u.

In step 222 of FIG. 3, the entries in List 1 and List 2 are weighted andcombined to generate a selection list 4. The weighting and combining isa function based upon parameter values that are modified by the operatoras discussed above. In this example, the ranking of List 1 and List 2are weighted evenly, or 50% and 50%. The manner in which the ranking isweighted is by assigning a value of 6 to the highest ranked entry, 5 tothe next highest ranked entry and so on to a value of 1 for the lowestranked entry. The values are divided by 2 since each list is weightedthe same and the values are added together for a total weighted combinedrank value.

The weighting and combining of step 222 of FIG. 3 is shown at 330 inFIG. 4. Thus, entry b is highest ranked in List 1 and has a value of 6divided by 2, which is added to value b's ranking in List 2 given avalue of 4 divided by 2. Thus, the total combined and weighted value forentry b is 5 as shown in FIG. 4. Similarly, entries z, c, q, t and u areweighted. Selection list 4 at 332 is shown with the five highest rankedentries according to the calculation performed by step 222 of FIG. 3.

As discussed above, at the completion of step 222, a selection list isgenerated for the current value of m. In the example discussed so far,selection list 4 is created with five entries (entries below the topfive are arbitrarily ignored in this implementation) and executes andproceeds to step 224 where m is decremented, the temp. list is clearedand the loop of 210-216 is reentered after step 208.

FIG. 5 shows a detailed example of the execution of steps 226 and 228 ofFIG. 3.

FIG. 5 shows selection list 4, selection list 3, selection list 2 andselection list 1. These selection lists were generated by the executionof steps 208-224 of flowchart 200 of FIG. 3 in the manner discussedabove. Once generated, execution proceeds to step 226 where theselection lists are combined according to a parameterized formula. Asbefore, the parameters can be varied by the operator of the system. Oneof the parameters is the length of each selection list which has beenset to five in the present example. Thus, even if during execution ofsteps 210-224, the number of entries of List 1, List 2 or the selectionlist exceeds 5, ultimately only the highest ranked 5 entries areincluded in the selection list. Also, the weighting of each selectionlist is shown to the right of the selection list named at 350 of FIG. 5.Thus, selection list 1 is weighted at 0.10, selection list 2 is weightedat 0.20, selection list 3 is weighted at 0.30 and selection list 4 isweighted as 0.40. The weights for the selection lists can be set by theoperator. At 352, the calculations to get the combined and weightedrankings of the entries are shown.

As before, each entry is assigned a point value based on its position inthe list. The highest ranked entry is given the value 5, the nexthighest ranked entry the value 4, and so on, down to the lowest rankedentry which has a value 1. These values are multiplied by the weight forthe selection list. So for example, entry b appears in selection list 4,selection list 3 and selection list 1, but does not appear in selectionlist 2. Thus, the calculation for entry b shown at 354 of FIG. 5multiplies the ranking of entry b in selection list 4, that is 5, withthe weight of selection list 4, that is 0.4. Similarly, a ranking of 2in selection list 3 is multiplied by the weight for selection list 3 of0.3. Since entry b is not present in selection list 2, there is noweighted value associated with that list. A ranking of 5 is multipliedby the weight 0.1 for selection list 1. Thus, the total for entry b is avalue of 3.1. Similarly, the weighted and combined values for each entrythat appears in one of the selection lists is given at 352 of FIG. 5.

The ordered weighted and combined ranking of entries is shown in list360 of FIG. 5. A predetermined number (another parameter) of the topmostentries of list 360 are presented to the user at step 228 as the resultsof the correlation routine of flowchart 200. Thus, assuming only thethree highest entries are used, the user is presented with entries c, qand b. Remembering that these letters represent recording artists, theuser is presented with three artists' names that the user is likely tobe interested in.

Thus, the discussion of FIGS. 3, 4 and 5 illustrates how the presentinvention provides a correlation method for determining selections basedon user preferences where the determined selections are likely to beones that the user is interested in.

Next, Table II and FIGS. 6A, 6B and 7 will be discussed to illustrate amethod of making “Object to Object” correlations in the presentinvention.

Table II lists the steps of a method to perform “Object to Object”correlation to generate suggested selections, or recommendations, basedon a comparison of users' preference lists that already exist in thedatabase. The steps of Table II are discussed in reference to FIGS. 6Aand 6B where a detailed discussion performing the steps of this methodin a computer system is presented.

TABLE II Object to Object Correlation One or more persons list theirfavorite objects 1 . . . n. To generate a list of similarly likedobjects based on any one particular object, the following procedure isfollowed: For every possible combination of two objects, identify thenumber of people having listed both objects. For each unique combinationof two objects, identify which of the two objects has the lesser and thegreater number of listings throughout the entire database. For eachunique combination of two objects, calculate two values. For the firstvalue, calculate the ratio of the number of people listing both objectsdivided by the total number of people listing the lesser objectthroughout the entire database. For the second value, calculate theratio of the number of people listing both objects divided by the totalnumber of people listing the greater object throughout the entiredatabase. For each unique combination of two objects, calculate a thirdvalue representing a weighted average of the first two values. Theweighted average can range from 100% of the first value and 0% of thesecond value to 0% of the first value and 100% of the second value.Create for each object a ranking, from maximum to minimum, of all otherobjects as determined by the weighted average value calculated above.Use this final ranking to provide a list of similarly liked objects.

FIGS. 6A and 6B show flowchart 400 of a routine that executes the stepsof Table 2. In FIG. 6A, routine 400 is entered at 402. It is assumedthat first and second weighting factors, referred to as “weighting1” and“weighting2” are defined at the time of entry of the routine.

The values for the weighting factors are variable at the control panel,as described above, or by other suitable means. It is further assumedthat a database of preference lists such as database 302 of FIG. 4,including lists 304, 306, 308 and 310 exists.

At step 404, a unique pair of objects is chosen. In the examplediscussed above, where objects are names of recording artists, step 404chooses two artists' names from the possible names in the database. Atstep 406 the number of lists containing both objects in the pair isdetermined. For ease of discussion, we represent this number as “L”.

Referring to FIG. 4, for example, if the pair “ad” is selected, lists304 and 310 include the pair since they each include both object “a” andobject “d” whereas lists 306 and 308 do not include the pair.

At step 408, the number of lists containing the first object of thepair, in this case “a”, is determined. This number is represented as“n1” in this discussion. In our example of FIG. 4, database 302 showsthree lists, namely lists 304, 308, and 310, containing the object “a”.Thus, n1 would be set to 3.

In step 410, the number of lists containing the second object of thepair, in this case “d”, is determined and represented as “n2”. In thiscase, n2 would also be set to 3 since lists 304, 306 and 310 eachcontain the object “d”.

At step 412 a check is made as to whether n1 is greater than n2.Assuming n1 is greater than n2, execution proceeds to step 414 where“weighting1” is used as a weighting factor with n1. The result of theweighting operation using weighting1 and n1 is represented as w1. In adescription of the preferred embodiment described below, ‘1’ is dividedby the value n1 before the weighting factor is used on the result of thedivision. However, any manner of using the weighting factors on thenumber of lists containing the predetermined object is within the scopeof the invention.

Next, step 416 is executed whereby weighting factor “weighting2” isapplied to the value n2. This result is designated as w2.

Assuming, at step 412, that n1 is not greater than n2, executionproceeds to step 418 where, in contrast to step 414, weighting factor“weighting2” is used on the value n1. The result of this is againdesignated w1. At step 420, “weighting1” is used as a weighting factoron n2 and the result is designated w2.

Thus it can be seen that, depending on the relationship of n1 as greaterthan, or less than or equal to, n2, the weighting factors are applied ina first or second order to the numbers that designate the number oflists containing first and second objects, respectively, in the selectedpair.

Flowchart 400 is continued in FIG. 6B according to the connectioncircles “A” and “B”.

FIG. 6B shows step 422 executed subsequently to either of steps 416 or420. At step 422, the results w1 and w2 are combined and stored as aranking number, “ranking#,” for the selected pair. At step 424, a checkis made as to whether there are more unique pairs to be processed. Ifso, execution returns to step 404. If not, execution continues to steps426-434 where the ranking# of each of the pairs processed above is usedto create an ordered list of selections associated with each of thepairs.

At step 426, an object from the data is selected to generate pairs. Forexample, with object “a”, possible pairs (assuming the lists 304, 306,308 and 310 form the database) are ab, ap, as, ad, al, az, ay, and aq.After executing steps 404-424, described above, each of these pairs willhave a ranking# associated with the pair.

At step 428, all occurrences of the selected object in pairs isdetermined. This is merely the list of pairs that include the object “a”as listed above. At step 430, the non-selected object in each pair isranked in a list according to the ranking# for the pair. At step 432,the list is associated with the selected object.

At step 434 a check is made to see whether there are any more objects tobe processed. If so, execution returns to step 424, where anotherunprocessed object is selected. After all the objects have beenprocessed, execution falls to step 436 where the routine of flowchart400 is exited. At the completion of the routine each unique object inthe database will have an associated ranked list. The ranked lists areindicative of the popularity of the ranked artists among users of thesystem that have also chosen the selected object, or selected artist.Thus, the ranked lists provide a way to recommend artists to a usergiven that the user prefers the selected artist.

FIG. 7 shows Table 500 to further illustrate the method described inTable 2 and FIGS. 6A and 6B.

In FIG. 7, Table 500 shows names of objects in the database along thetop and left sides of the table. Table 500 shows only a portion of theobjects in the database for ease of discussion. For each row in Table500 an object, such as object 502, is designated. At the right of therow is the ranked list associated with the object. For object 502, list504 is the associated list. Note that list 504 shows objects c, b, e,and d ranked in that order.

Next, a discussion of the derivation of the ranked lists of FIG. 7, suchas ranked list 504, is presented.

The top of Table 500 of FIG. 7 shows an object, such as object 504,associated with a number, such as number 506. Thus, in FIG. 7, object bis associated with the number 26052. This number is the frequency ofobject b in the database. That is, object b occurs in 26052 differentlists in the database. Similarly, object a occurs 24680 times, object coccurs 47 times, object d occurs 768 times and object e occurs 11298times.

Each box within the body of Table 500 represents statistics on the pairgiven by the row and column corresponding to the box. For example, box508 includes statistics on the pair of objects a and b. The top numberin the box, 15629, is the number of lists in the database that includesboth of the objects in the pair, that is, a and b. The second line inthe box shows the object of the pair that occurs most frequently inlists in the database. Thus, in box 508, object b occurs more frequentlyin the database than object a. This is verified by noting that b occurs26052 times and that a occurs 24680 times.

The number 60.0 results from dividing the number of occurrences of thepair ab by the number of lists that contain the more frequentlyoccurring object, b. In other words, 60.0 is the result of dividing15629 by 26052. This number is shown multiplied by 0.1 to yield a resultof 6.0. 0.1 is the first weighting factor and is the same for each ofthe calculations in each of the boxes of Table 500.

Similar to the above, the next line in box 508 shows the object of lessfrequent occurrence in the database, namely, object a. To the right ofthe object ‘a’ designator is the value 63.3. This value was obtained bydividing the frequency of the pair in lists in the database by thefrequency of the occurrence of object ‘a’ in lists in the database. Thatis, 63.3 is the result of dividing 15629 by 24680. The value 0.9 is thesecond weighting factor and is used to modify the calculation thatincludes the number of occurrences of the less frequently occurringobject of the pair. This gives a result of 57.0.

Finally, the result of the weighting calculations, that is 6.0 and 57.0,are combined to form a final ranking value of 63.0.

Referring to flowchart 400 of FIG. 6A, at step 404 a unique pair ofobjects is selected. In FIG. 7, this would be, for example, theselection of objects a and b at 502 and 504. At step 406, the number oflists containing both objects in the pair is determined. This is thenumber 15629 of box 508 of FIG. 7. At step 408, the number of listscontaining the first object of the pair is determined. The arbitraryfirst object of the pair is object a and the number of lists containingobject a is shown in Table 500 as 24680. Similarly, step 410 of FIG. 6Adetermines the number of lists containing the second object of the pair,or object b. In Table 500 of FIG. 7, this is the number 26052. Note thatnumbers such as 24680 and 26052 are obtained by scanning the preferencelists in the database such as preference lists 304, 306, 308 and 310 ofdatabase 302 shown on FIG. 4. This scanning can be by any means known inthe art.

At step 412, a check is made as to which of the first or second objectsin the pair occurs with the highest frequency in the lists. This check,along with steps 414-420, is used to apply first and second weightingfactors, designated weighting1 and weighting2, to a number that is basedon the frequency of occurrence of each of the objects of the pair inlists in the database. For example, weighting1 has the value 0.1 andweighting2 has the value 0.9.

Simply put, the steps 412-420 will always apply weighting1, or 0.1, tothe calculation using the number of occurrences of the more frequentlyoccurring object in the pair. Weighting factor “weighting2” is alwaysapplied to the number of occurrences of the less frequently occurringobject. Thus, in box 508, 0.1 is applied to a calculation using thenumber of occurrences of object b while 0.9 is applied to a calculationusing the number of occurrences of object a. In boxes 510, 512, and 514,object a occurs more frequently than its paired object, that is, objectc, d, or e, respectively. Thus, in each of boxes 510, 512 and 514, thecalculation involving the number of occurrences of object a is appliedwith the weighting factor of 0.1.

The result in each of the boxes, such as the result 63.0 in box 508, isused to rank the second object of the pair in a list where the list isthen associated with the first object in the pair. Thus, object c with aresult of 72.8 is ranked at the top of list 504. Similarly, objects b, eand d are ranked successively in list 504. List 504 is then associatedwith object a.

The second row of table 500 shows that object b has list 516 associatedwith it and that list 516 includes objects a, d, e and c. Similarly,objects c, d and e each have a list associated with them as shown by theobject and associated list at opposite ends of each of the rows of Table500 of FIG. 7. Note that the diagonal boxes of Table 500 need not becomputed since it is irrelevant to have a pair of the same object, suchas pairing object a with object a. Also, the lower left triangularportion of Table 500 does not need to be computed since the computationis identical for its counterpart across the main diagonal of the table.

Once the lists of table 500 have been derived, recommended selections ofobjects based on a given object can be obtained from the table. Forexample, given object a, list 504 would be accessed to returnrecommendations of objects c, b, e and d in order of descendinglikelihood of also being preferred by a user who already prefers objecta.

Thus, the method of Table 2 is a way to achieve object to objectcorrelation in the present invention.

In the foregoing specification, the invention has been described withreference to specific exemplary embodiments thereof. It will, however,be evident that various modifications and changes may be made thereuntowithout departing from the broader spirit and scope of the invention asset forth in the appended claims. For example, various programminglanguages and techniques can be used to implement the disclosedinvention. Also, the specific logic presented to accomplish tasks withinthe present invention as disclosed by the flowcharts may be modifiedwithout departing from the scope of the invention. Many such changes ormodifications will be readily apparent to one of ordinary skill in theart. The specification and drawings are, accordingly, to be regarded inan illustrative rather than a restrictive sense, the invention beinglimited only by the provided claims.

1. A method for outputting recommended preferences based onpredetermined preferences, the method executing on a computer system,the computer system including a processor, database, first input device,and an output device, wherein the database includes a plurality ofdatafiles each containing a plurality of preferences, the methodcomprising the following steps: a) accepting signals from the firstinput device to indicate a plurality of user preferences; b) using theprocessor to search the database and to determine the number of userpreferences that match preferences in a given datafile; c) using theprocessor to identify datafiles with a number of matching preferencesabove a first threshold number; d) selecting preferences from theidentified datafiles, wherein the selected preferences do not match theuser preferences; e) for each unmatching preference in the relateddatafiles, performing a submethod comprising: e1) determining a firstnumber of related datafiles in which the unmatching preference occurs;e2) determining a second number of all datafiles in the database inwhich each unmatching preference occurs; and e3) determining a thirdnumber as the proportion of the first and second determined numbers; f)determining a first ranking of said selected unmatching preferences fromthe related datafiles based on said first determined numbers; g)determining a second ranking of said selected unmatching preferencesfrom the related datafiles based on said third determined numbers; h)selecting one or more recommended preferences based on a weightedcombination of said first ranking and said second ranking; and i)outputting via the output device, the recommended preferences.
 2. Themethod of claim 1, wherein the user preferences comprise artist's name.3. The method of claim 1, wherein the user preferences comprise a titleof a movie.
 4. The method of claim 1, wherein the computer systemfurther includes a data communications network and a second inputdevice, wherein the processor, database, first input device, secondinput device and output device are coupled to the network, wherein thefirst and second user input devices are remotely located from eachother.
 5. The method of claim 1, wherein the number of preferences in adatafile is limited to 10, and wherein the first threshold number is 5.6. The method of claim 1, wherein the number of preferences in adatafile is limited to 10, and wherein only those unmatching preferencesthat also appear in 50% or more of the related datafiles are selected asrecommended preferences.
 7. A computer-readable medium having storedthereon instructions that, when executed, perform a method foroutputting recommended preferences based on predetermined preferences,the method executing on a computer system, the computer system includinga processor, database, first input device, and an output device, whereinthe database includes a plurality of datafiles each containing aplurality of preferences, the method comprising the following steps: a)accepting signals from the first input device to indicate a plurality ofuser preferences; b) using the processor to search the database and todetermine the number of user preferences that match preferences in agiven datafile; c) using the processor to identify related datafileswith a number of matching preferences above a first threshold number; d)selecting preferences from the related datafiles, wherein the selectedpreferences do not match the user preferences; e) for each unmatchingpreference in the related datafiles, performing a submethod comprising:e1) determining a first number of related datafiles in which theunmatching preference occurs; e2) determining a second number of alldatafiles in the database in which each unmatching preference occurs;and e3) determining a third number as the proportion of the first andsecond determined numbers; f) determining a first ranking of saidselected unmatching preferences from the related datafiles based on saidfirst determined numbers; g) determining a second ranking of saidselected unmatching preferences from the related datafiles based on saidthird determined numbers; h) selecting one or more recommendedpreferences based on a weighted combination of said first ranking andsaid second ranking; and i) outputting via the output device, therecommended preferences.
 8. A computer apparatus having a processor andmemory containing computer executable instructions that, when executedby the processor, perform a method for outputting recommendedpreferences based on predetermined preferences, the method executing ona computer system, the computer system further including a database,first input device, and an output device, wherein the database includesa plurality of datafiles each containing a plurality of preferences, themethod comprising the following steps: a) accepting signals from thefirst input device to indicate a plurality of user preferences; b) usingthe processor to search the database and to determine the number of userpreferences that match preferences in a given datafile; c) using theprocessor to identify related datafiles with a number of matchingpreferences above a first threshold number; d) selecting preferencesfrom the related datafiles, wherein the selected preferences do notmatch the user preferences; e) for each unmatching preference in therelated datafiles, performing a submethod comprising: e1) determining afirst number of related datafiles in which the unmatching preferenceoccurs; e2) determining a second number of all datafiles in the databasein which each unmatching preference occurs; and e3) determining a thirdnumber as the proportion of the first and second determined numbers; f)determining a first ranking of said selected unmatching preferences fromthe related datafiles based on said first determined numbers; g)determining a second ranking of said selected unmatching preferencesfrom the related datafiles based on said third determined numbers; h)selecting one or more recommended preferences based on a weightedcombination of said first ranking and said second ranking; and i)outputting via the output device, the recommended preferences.